智能论文笔记

Machine Learning-Based COVID-19 Patients Triage Algorithm using Patient-Generated Health Data from Nationwide Multicenter Database

Hyung Ju Hwang , Seyoung Jung , Min Sue Park , Hyeontae Jo , Haeun Lee

分类：机器学习 | 人工智能

2021-09-18

迅速严重性评估患有传染病感染的确诊患者的评估模型可以实现高效的诊断和减轻医疗系统的负担。本文利用机器学习技术提供了严重性评估模型的开发过程及其在SARS-COV-2患者的应用。在这里，我们强调我们的模型只需要基本患者的基本个人数据，从而允许他们判断自己的严重程度。我们选择了基于升级的决策树模型作为分类器，并将死亡率解释为建模后的概率分数。具体而言，使用贝叶斯优化技术调整确定树模型结构的超参数，而不知道医疗信息。结果，我们测量了模型性能并识别通过模型影响严重性的变量。最后，我们的目标是建立一个允许患者检查自己的严重性的医疗系统，并根据其他患者的过去的治疗细节来访问他们访问适当的诊所中心。

translated by 谷歌翻译

Critic-Guided Decoding for Controlled Text Generation

Minbeom Kim , Hwanhee Lee , Kang Min Yoo , Joonsuk Park , Hwaran Lee , Kyomin Jung

分类：自然语言处理

2022-12-21

Steering language generation towards objectives or away from undesired content has been a long-standing goal in utilizing language models (LM). Recent work has demonstrated reinforcement learning and weighted decoding as effective approaches to achieve a higher level of language control and quality with pros and cons. In this work, we propose a novel critic decoding method for controlled language generation (CriticControl) that combines the strengths of reinforcement learning and weighted decoding. Specifically, we adopt the actor-critic framework to train an LM-steering critic from non-differentiable reward models. And similar to weighted decoding, our method freezes the language model and manipulates the output token distribution using called critic, improving training efficiency and stability. Evaluation of our method on three controlled generation tasks, namely topic control, sentiment control, and detoxification, shows that our approach generates more coherent and well-controlled texts than previous methods. In addition, CriticControl demonstrates superior generalization ability in zero-shot settings. Human evaluation studies also corroborate our findings.

translated by 谷歌翻译

Compliant Suction Gripper with Seamless Deployment and Retraction for Robust Picking against Depth and Tilt Errors

Yuna Yoo , Jaemin Eom , Min Jo Park , Kyu-Jin Cho

分类：机器人

2022-11-29

Applying suction grippers in unstructured environments is a challenging task because of depth and tilt errors in vision systems, requiring additional costs in elaborate sensing and control. To reduce additional costs, suction grippers with compliant bodies or mechanisms have been proposed; however, their bulkiness and limited allowable error hinder their use in complex environments with large errors. Here, we propose a compact suction gripper that can pick objects over a wide range of distances and tilt angles without elaborate sensing and control. The spring-inserted gripper body deploys and conforms to distant and tilted objects until the suction cup completely seals with the object and retracts immediately after, while holding the object. This seamless deployment and retraction is enabled by connecting the gripper body and suction cup to the same vacuum source, which couples the vacuum picking and retraction of the gripper body. Experimental results validated that the proposed gripper can pick objects within 79 mm, which is 1.4 times the initial length, and can pick objects with tilt angles up to 60{\deg}. The feasibility of the gripper was verified by demonstrations, including picking objects of different heights from the same picking height and the bin picking of transparent objects.

translated by 谷歌翻译

MoDA: Map style transfer for self-supervised Domain Adaptation of embodied agents

Eun Sun Lee , Junho Kim , SangWon Park , Young Min Kim

分类：机器人 | 计算机视觉

2022-11-29

We propose a domain adaptation method, MoDA, which adapts a pretrained embodied agent to a new, noisy environment without ground-truth supervision. Map-based memory provides important contextual information for visual navigation, and exhibits unique spatial structure mainly composed of flat walls and rectangular obstacles. Our adaptation approach encourages the inherent regularities on the estimated maps to guide the agent to overcome the prevalent domain discrepancy in a novel environment. Specifically, we propose an efficient learning curriculum to handle the visual and dynamics corruptions in an online manner, self-supervised with pseudo clean maps generated by style transfer networks. Because the map-based representation provides spatial knowledge for the agent's policy, our formulation can deploy the pretrained policy networks from simulators in a new setting. We evaluate MoDA in various practical scenarios and show that our proposed method quickly enhances the agent's performance in downstream tasks including localization, mapping, exploration, and point-goal navigation.

translated by 谷歌翻译

Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching

Yeonhong Park , Sunhong Min , Jae W. Lee

分类：机器学习

2022-08-19

最近，Graph神经网络（GNNS）已成为聚光灯作为强大的工具，可以有效地在图形结构化数据上执行各种推理任务。随着现实图表的大小继续扩展，GNN训练系统面临可扩展性挑战。分布式培训是一种流行的方法，可以通过扩展CPU节点来应对这一挑战。但是，对基于磁盘的GNN培训的关注不多，该培训可以通过利用NVME SSD等高性能存储设备来以更具成本效益的方式扩展单节点系统。我们观察到，主内存和磁盘之间的数据移动是基于SSD的训练系统中的主要瓶颈，并且常规的GNN训练管道是不错的选择，而无需考虑此开销。因此，我们提出了Ginex，这是第一个基于SSD的GNN训练系统，可以在单台计算机上处理数十亿个图形数据集。受到编译器优化的检查员执行模型的启发，Ginex通过分开样品和收集阶段来重组GNN训练管道。这种分离使Ginex能够实现一种可证明的最佳替换算法，即被称为Belady的算法，用于存储器中的Caching特征向量，该算法是I/O访问的主要部分。根据我们对40亿尺度图数据集的评估，Ginex平均比SSD扩展的Pytorch几何得出了2.11倍的训练吞吐量（最大最高2.67倍）。

translated by 谷歌翻译

Non-Autoregressive Sign Language Production via Knowledge Distillation

Eui Jun Hwang , Jung Ho Kim , Suk min Cho , Jong C. Park

分类：机器学习 | 自然语言处理

2022-08-12

手语制作（SLP）旨在将语言的表达方式转化为手语的相应语言，例如基于骨架的标志姿势或视频。现有的SLP型号是自动回旋（AR）或非自动入口（NAR）。但是，AR-SLP模型在解码过程中遭受了回归对均值和误差传播的影响。 NSLP-G是一种基于NAR的模型，在某种程度上解决了这些问题，但会带来其他问题。例如，它不考虑目标符号长度，并且会遭受虚假解码启动的影响。我们通过知识蒸馏（KD）提出了一种新型的NAR-SLP模型，以解决这些问题。首先，我们设计一个长度调节器来预测生成的符号姿势序列的末端。然后，我们采用KD，该KD从预训练的姿势编码器中提取空间语言特征以减轻虚假解码的启动。广泛的实验表明，所提出的方法在特里切特的手势距离和背面翻译评估中都显着优于现有的SLP模型。

translated by 谷歌翻译

On the Importance of Critical Period in Multi-stage Reinforcement Learning

Junseok Park , Inwoo Hwang , Min Whoo Lee , Hyunseok Oh , Minsu Lee , Youngki Lee , Byoung-Tak Zhang

分类：人工智能 | 机器学习 | 神经与进化计算

2022-08-09

婴儿生命的最初几年被称为关键时期，在此期间，由于神经可塑性，学习绩效的总体发展受到显着影响。在最近的研究中，具有深层神经网络模仿实际神经元的深层神经网络的AI药物表现出与人类关键时期类似的学习期。特别是在此初期，适当的刺激在发展学习能力中起着至关重要的作用。但是，将人类的认知偏见转变为适当的塑造奖励是非常具有挑战性的，并且在关键时期的先前工作并不集中于寻找适当的刺激。为了进一步迈出一步，我们建议多阶段的增强学习强调在关键时期发现``适当的刺激''。受到人类早期认知发展阶段的启发，我们在关键时期附近使用多阶段的指导，并证明就AI代理的性能，效率和稳定性而言，适当的成型奖励（2阶段指导）。

translated by 谷歌翻译

PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation

Kwonyoung Kim , Jungin Park , Jiyoung Lee , Dongbo Min , Kwanghoon Sohn

分类：计算机视觉 | 机器学习

2022-07-27

在线立体声适应解决了由合成（训练）和真实（测试）数据集之间的不同环境引起的域移位问题，以迅速适应动态现实世界应用程序（例如自动驾驶）中的立体声模型。但是，以前的方法通常无法抵消与动态物体有关的特定区域，并具有更严重的环境变化。为了减轻此问题，我们建议将辅助点选择性网络纳入称为PointFix的元学习框架中，以提供对在线立体声适应的立体声模型的强大初始化。简而言之，我们的辅助网络学会通过通过元级别有效地反向传播局部信息来固定局部变体，从而实现基线模型的稳健初始化。该网络是模型 - 不合时宜的，因此可以以任何插件的方式以任何形式的架构使用。我们进行了广泛的实验，以在三个适应设置（例如短期，中和长期序列）下验证我们的方法的有效性。实验结果表明，辅助网络对基本立体声模型的适当初始化使我们的学习范式在推理时达到了最新的性能。

translated by 谷歌翻译

Grounding Visual Representations with Texts for Domain Generalization

Seonwoo Min , Nokyung Park , Siwon Kim , Seunghyun Park , Jinkyu Kim

分类：计算机视觉 | 自然语言处理 | 机器学习

2022-07-21

减少源和目标域之间的表示形式差异是最大化模型概括的关键组件。在这项工作中，我们倡导利用自然语言监督域的概括任务。我们将两个模块介绍给地面视觉表示，其中包含人类典型推理的文本：（1）视觉和文本关节嵌入器以及（2）文本解释发生器。前者学习图像文本的关节嵌入空间，我们可以将高级类别歧视性信息接地到模型中。后者利用了一个可解释的模型，并生成了解释，证明其决定背后的理由是合理的。据我们所知，这是为域泛化任务利用视觉和语言跨模式方法的第一项工作。我们使用新创建的CUB-DG基准数据集进行的实验表明，可以成功地将跨模式监督用于接地域不变的视觉表示并改善模型的概括。此外，在大规模域基准测试中，我们提出的方法可实现最先进的结果，并在五个多域数据集的平均性能中排名第一。数据集和代码可在https://github.com/mswzeus/gvrt上找到。

translated by 谷歌翻译

A Data-Based Perspective on Transfer Learning

Saachi Jain , Hadi Salman , Alaa Khaddaj , Eric Wong , Sung Min Park , Aleksander Madry

分类：机器学习

2022-07-12

人们普遍认为，在传输学习中，包括更多的预训练数据可以转化为更好的性能。但是，最近的证据表明，从源数据集中删除数据实际上也可以提供帮助。在这项工作中，我们仔细研究了源数据集在转移学习中的作用，并提出了探索其对下游性能的影响的框架。我们的框架产生了新的功能，例如精确转移学习脆弱性以及检测诸如数据渗漏等病理和源数据集中存在误导示例之类的病理。特别是，我们证明，消除通过框架确定的有害数据点可改善来自ImageNet的转移学习绩效，以了解各种目标任务。代码可从https://github.com/madrylab/data-transfer获得

translated by 谷歌翻译